- preprint
Scaling World-Model Reinforcement Learning Through Diffusion Policy Optimization
Xiaoyuan Cheng, Wenxuan Yuan, Zhancun Mu, Yuanzhao Zhang, Yiming Yang, Hai Wang, Zhuo Sun, Che Liu
arXiv preprint • 2026
- conference
Optimizing Latent Goal by Learning from Trajectory Preference
Guangyu Zhao, Kewei Lian, Haoxuan Ru, Borong Zhang, Haowei Lin, Zhancun Mu, Haobo Fu, Qiang Fu, Shaofei Cai, Zihao Wang, Yitao Liang
ICML 2026 • 2026
- preprint
Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning
Zhancun Mu, Guangyu Zhao, Yiwu Zhong, Chi Zhang†
Preprint • 2026
- preprint
DeFlow: Decoupling Manifold Modeling and Value Maximization for Offline Policy Extraction
Zhancun Mu
arXiv preprint • 2026
- preprint
ROCKET-3: Scalable Multi-Task RL for Generalizable Spatial Intelligence in Visuomotor Agents
Shaofei Cai*, Zhancun Mu*, Haiwen Xia, Bowei Zhang, Anji Liu, Yitao Liang
arXiv preprint • 2025
- preprint
MineStudio: A Streamlined Package for Minecraft AI Agent Development
Shaofei Cai*, Zhancun Mu*, Kaichen He, Bowei Zhang, Xinyue Zheng, Anji Liu, Yitao Liang
arXiv preprint • 2024
- conference
Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
Haoqi Yuan, Zhancun Mu, Feiyang Xie, Zongqing Lu
ICLR 2024 (Oral) • 2024